Comparison of Perceived Prosodic Boundaries and Global Characteristics of Voice Fundamental Frequency Contours in Mandarin Speech
نویسندگان
چکیده
Although there have been many studies on the prosodic structure of spoken Mandarin as well as many proposals for labeling the prosody of spoken Mandarin, the labeling of prosodic boundaries in all the existing annotation systems relies on auditory perception, and lacks a direct relation to the acoustic process of prosody generation. Besides, perception-based annotation cannot ensure a high degree of consistency and reliability. In the present study, we investigate the phrasing of spoken Mandarin from the production point of view, by using an acoustic model for generating F0 contours. The relationship between perceived prosodic boundaries at various layers and phrase commands derived from the model-based analysis of F0 contours is then revealed. The results indicate that a perception-based prosody labeling system cannot describe the prosodic structure as accurately as the model for F0 contour generation.
منابع مشابه
بررسی برخی ویژگی های آکوستیک گفتار نوزاد مدار در مادران فارسی زبان
Introduction: When adults talk to another person, linguistic characteristics of the listener will also be considered. A clear example of speech changes depending on the listener is maternal or infant directed speech. Infant directed speech is more slowly with longer sentences and pauses at the end of the utterance. Undoubtedly the most distinctive feature of this style of speech is acoustic c...
متن کاملMandarin Tones Recognition by Segments of Fundamental Frequency Contours
Mandarin is one of the tonal languages. In Mandarin tones, there are four lexical tones (tone 1 to tone 4) with four different fundamental frequency (f0), such as flat and high, rising, falling and then rising, and falling, respectively. In order to process signal on lexical tone, at first we have to identify which tone is. We would like to find out an efficient approach to identify Mandarin to...
متن کاملQuantitative analysis of F0 contours of emotional speech of Mandarin
For emotional speech synthesis, a quantitative model giving a parametric representation of F0 contours is needed. Purpose: investigate quantitatively F0 characteristics of Mandarin speech in four basic emotions (anger, fear, joy, and sadness) and in neutral reading. Two approaches are compared: surface features analysis from time-normalized F0 contours analysis-by-synthesis of time-intact F0 co...
متن کاملUse of Prosodic Features in Speech Recognition
Two methods were proposed for the use of prosodic features in speech recognition: one to detect major syntactic (phrase) boundaries as the initial phase of speech recognition, and the other to check the feasibility of the results of ordinary recognition process from the viewpoint of prosodic features. In the rst method, fundamental frequency contours were assumed as waveforms as functions of ti...
متن کاملA method of representing fundamental frequency contours of Japanese using statistical models of moraic transition
A statistical modeling of voice fundamental frequency contours was proposed for the purpose of developing effective ways to utilize prosodic features in speech recognition. In view of the fact that prosodic features should be treated in longer units, the proposed modeling represents the transition in moraic units. A fundamental frequency contour was rst segmented into moraic units and then each...
متن کامل